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Abstract 

We describe a measurement of the top quark mass from events produced in pp collisions at 
a center-of-mass energy of 1.96 TeV, using the Collider Detector at Fermilab. We identify ti 
candidates where both W bosons from the top quarks decay into leptons (eu, fiu, or tv) from a data 
sample of 360 pb^ 1 . The top quark mass is reconstructed in each event separately by three different 
methods, which draw upon simulated distributions of the neutrino pseudorapidity, ti longitudinal 
momentum, or neutrino azimuthal angle in order to extract probability distributions for the top 
quark mass. For each method, representative mass distributions, or templates, are constructed 
from simulated samples of signal and background events, and parameterized to form continuous 
probability density functions. A likelihood fit incorporating these parameterized templates is then 
performed on the data sample masses in order to derive a final top quark mass. Combining the three 
template methods, taking into account correlations in their statistical and systematic uncertainties, 
results in a top quark mass measurement of 170.1 ± 6.0 (stat.) ± 4.1 (syst.) GeV/c 2 . 

PACS numbers: 14.65Ha, 13.85. Qk, 13.85.Ni 
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I. INTRODUCTION 



The top quark, the weak isospin partner of the bottom quark, was first observed by the 
CDF and D0 collaborations in pp collisions produced at the Fermilab Tevatron During 
Run I operation from 1992 to 1995, CDF acquired 109 pb -1 of data at a center-of-mass 
energy of 1.8 TeV, and performed the first measurements of top quark properties. Since the 
start of Run II at the Tevatron in 2001, CDF has collected integrated luminosities several 
times that of Run I. Increased top production from a higher collision energy and improved 
acceptance of the upgraded detector have further enhanced the Run II top quark yield. This 
larger sample size allows for more precise studies of the characteristics of the top quark. 

As with all quarks, the top quark mass is not predicted by theory, and therefore represents 
a free parameter in the standard model which must be experimentally determined. Tevatron 

n 

Run I measurements yielded a top quark mass of 178.0 ± 4.3 GeV/c 2 |2j, approximately 40 
times heavier than the next heaviest quark, the bottom quark. Such a large mass, close 
to the electroweak symmetry breaking scale v = (V^Gf)' 1 ^ 2 ~ 246 GeV, suggests that the 
top quark may play a special role in this process |3j. The subsequently large contribution 
to quark-loop corrections of electroweak parameters from the heavy top quark provides for 
powerful tests of the standard model. In particular, a precise measurement of the top quark 
mass, coupled with that of the W boson, leads to tighter constraints on the as yet unobserved 
Higgs boson ^]. 

At the Tevatron, in pp collisions with a center-of-mass energy of 1.96 TeV, top quarks 
are produced mainly in ti pairs, through qq annihilation (~ 85%) and gluon-gluon fusion. 
Because of its large width and correspondingly short lifetime (~ 10~ 25 s), the top quark 
decays before any hadronization can take place, so that its existence as a "free quark" can 
be studied without the complication of lower energy QCD effects. In the framework of 
the standard model, each top quark decays almost exclusively to an on-shell W boson and 
a bottom quark. The b quark subsequently hadronizes into a jet of particles, while the W 
decays either to a qq' or a lepton-neutrino pair. Thus, the decays of the W bosons determine 
the characteristics of a ti event and, consequently, the event selection strategy. 

The "all hadronic" mode, where both Ws decay into qq' pairs, occurs for ~44% of ti 
events, but this topology is dominated by a large QCD multijet background. The most 
precise top quark mass measurements arise from the "lepton+jets" mode (~30% of events), 
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where one W decays hadronically while the other decays to either an electron or muon plus 
a neutrino, whose presence can be inferred from missing transverse energy in the detector. A 
third mode occurs when both W bosons from each top quark decay into leptons (eis, fiu, or 
tv). Though this "dilepton" mode accounts for only ~11% of tt events, such measurements 
are important in order to reduce the overall uncertainty on the top quark mass. Further, 
dilepton measurements test the consistency of top quark mass results obtained using other 
decay modes, as the dilepton mode contains different background sources and, therefore, 
represents an inherently different event sample. Since all top quark mass measurements 
assume a sample composition of tt and standard model background events, any discrepancy 
among the measured top masses could indicate the presence of new physics processes. 

This paper reports a measurement of the top quark mass with the CDF II detector by 
combining three analysis methods for the dilepton channel. Each analysis selects candidate 
tt dilepton decays using one of two complementary event selection strategies, which differ 
in lepton identification criteria and subsequent signal-to-background ratios. In each anal- 
ysis a single, representative top quark mass for each event is reconstructed using different 
kinematical assumptions in order to constrain the tt dilepton decay. The distributions of 
reconstructed top quark masses obtained from the data are compared with simulated mass 
distributions (templates) for signal and background events, and likelihood fits are used to 
arrive at a final top quark mass for each analysis technique. Accounting for correlations in 
statistical and systematic uncertainties, the results of the three analyses are then combined 
to determine the top quark mass in the dilepton channel using template methods. 



II. DETECTOR AND EVENT SELECTION 



The data sample used for these analyses was collected by the Collider Detector at Fer- 
milab |5j during Run II operation between March 2002 and August 2004. As depicted in 
Fig. m the CDF II detector is an azimuthally and forward-backward symmetric apparatus 
designed to study pp reactions at the Tevatron. We use a cylindrical coordinate system 
about the proton beam axis in which 9 is the polar angle, is the azimuthal angle, and 
pseudorapidity is defined as r\ = — ln[tan(0/2)]. The detector has a charged particle tracking 
system immersed in a 1.4 T magnetic field, aligned coaxially with the pp beams. The Run II 
Silicon Vertex Detector (SVX II) and Intermediate Silicon Layer (ISL) provide tracking 
over the radial range 1.5 to 28 cm [6]. A 3.1 m long open-cell drift chamber, the Central 
Outer Tracker (COT), covers the radial range from 40 to 137 cm [7j. The fiducial region of 
the silicon detector extends to pseudorapidity \r)\ ~2, while the COT provides coverage for 

Segmented electromagnetic and hadronic sampling calorimeters surround the tracking 
system and measure the energy flow of interacting particles in the pseudorapidity range 

to identify electron candidates 
for lepton candidates with 
10H located outside the central 



1 77 1 < 3.6. This analysis uses the new end plug detectors 
with 1.2 < \r]\ < 2.0 in addition to the central detectors 
\r]\ < 1.1. A set of drift chambers and scintillation counters 
hadron calorimeters and another set behind a 60 cm iron shield detect muon candidates with 
I ^7 1 <0.6. Additional chambers and counters detect muons in the region 0.6 < \t]\ < 1.0. Gas 
Cherenkov counters Q| located in the 3.7 < \r)\ <4.7 region measure the average number of 



inelastic pp collisions per bunch crossing and thereby determine the beam luminosity. 

The signature of ti decays in the dilepton channel is two jets from the b quarks, two high- 
momentum leptons and large missing energy (due to the unobserved neutrinos) from the 
W decays, and the possibility of extra jets from initial or final-state radiation. The major 
backgrounds for dilepton ti events are from Drell-Yan dilepton production (qq — > Z/j* — > 
e + e~ , ,t + t~), W(—> tv) + jets events where a jet "fakes" the signature of the second 
lepton, and diboson production (WW, WZ, ZZ). 

The data are derived from inclusive lepton triggers demanding central electrons with 
transverse energy Et = EsinO > 18 GeV, or central muons with transverse momentum 
p T = psin9 > 18 GeV/c. Electrons in the end plug are required to have Et > 20 GeV. 



in 



FIG. 1: Elevation view of the CDF II detector, showing the inner silicon microstrip detector, 
Central Outer Tracker drift chamber, electromagnetic and hadronic calorimeters, and muon drift 
chambers and scintillation counters. 

Events must also have a missing transverse energy ^ > 15 GeV, calculated from the vector 
sum — Y2i EtpUi, where rfj is the unit vector in the azimuthal plane which points from the 
beam line to the i th calorimeter tower. 

The top quark mass analyses described here employ one of two sets of selection criteria 
developed for the tt cross section measurement in the dilepton decay channel . The first 
method, referred to as the "dilepton" (DIL) analysis, is similar to that used in the CDF 
Run I measurement , and requires both candidate leptons to be specifically identified as 
either electrons or muons. A second "lepton+track" (LTRK) method increases the efficiency 
of the event selection (at the cost of a larger background) by requiring one well-identified 
lepton (electron or muon) in conjunction with an isolated track with large transverse mo- 
mentum. This method recovers events where leptons fall in calorimeter or muon detector 
gaps, and increases the acceptance for single prong hadronic decays of the r lepton from 
W — ► tv (approximately 20% of the total LTRK acceptance, compared with 12% for the 
DIL selection). 

Both selection methods demand a "tight" lepton in combination with a "loose" lepton of 
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opposite charge. Requirements for the tight lepton are identical for both methods, but differ 
for the loose lepton. Tight leptons must have well-measured tracks, based on the numbers 
of silicon and drift chamber hits and reconstructed vertex position, and have > 20 GeV. 
Tight leptons must also satisfy the isolation requirement that the total calorimeter within 
a cone AR = a/ Arf + A0 2 = 0.4 about the lepton trajectory not exceed 10% of the lepton's 
Et- Tight electrons must have lateral and longitudinal electromagnetic shower profiles in 
the calorimeter consistent with electrons, while tight muons must point to muon chamber 
hits and have a calorimeter signature compatible with minimum-ionizing particles. For 
the DIL method, loose leptons must be well-identified electrons or muons with no isolation 
requirement. Loose DIL electrons must be central, while the muon chamber hit requirements 
for loose DIL muons are relaxed. Loose leptons in the LTRK method, in contrast, are simply 
required to be well- measured and isolated tracks within \q\ < 1 and having pt > 20 GeV/c. 
The LTRK loose lepton isolation is determined from the pt sum of neighboring tracks within 
the cone AR = 0.4 about the lepton track, which must not exceed 10% that of the lepton. 

At least two jets are required per event, and are derived from looking for clusters of energy 
in calorimeter towers within a cone size of AR = 0.4. This total jet Et is corrected for non- 
uniformities in the response of the calorimeter as a function of ru effects from multiple pp 
collisions, and the hadronic jet energy scale of the calorimeter [l4j. Jets are required to 
have \rj\ < 2.5 and Et > 15 GeV for the DIL analysis, or < 2.0 and Et > 20 GeV for 
the LTRK method. The two highest Et jets for each event are assumed to stem from the b 
quarks; this assumption is true for ~70% of simulated tt events. For application in the top 
quark mass measurements, these jets are further corrected for energy deposited from the 
underlying pp event or lost outside the search cone AR = 0.4. The momentum components 
of each b quark are then calculated from the measured jet Et and angle by assuming a 
b quark mass of 5.0 GeV/c 2 . No explicit identification of b jets is used. 

The final signature of a dilepton tt event is missing transverse energy Et in the calorime- 
ter. For calorimeter tower clusters associated with an identified jet, the Et vector sum uses 
the transverse jet energy which has been corrected for calorimeter response and multiple 
pp collision effects. The Et for events with an identified muon is further corrected by the 
measured muon momentum. Dilepton ti events must satisfy the requirement Et > 25 GeV. 
False Et may arise through mismeasurement of the leptons or jets. Therefore, both DIL and 
LTRK methods require a minimum angular separation A<j) between lepton or jet trajectories 
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and the Et vector. For the DIL selection, events must have A0 > 20° for all leptons and 
jets if Et < 50 GeV. In the LTRK method, the Et vector cannot be within 5° of either 
the tight lepton direction or the axis of the loose lepton, and jets must have A0 > 25° for 
events with Et < 50 GeV. 

The dominant source of background for both selection methods is from Drell-Yan (qq — > 
Zj^i* — > ee,/i/x) events. These events should have no real Et , and can only satisfy the 
selection criteria if there is mismeasurement of the lepton or jet Et- Therefore, additional 
selection requirements are imposed for events where the reconstructed invariant mass of the 
two lepton candidates lies within 15 GeV/c 2 of the Z boson resonance. For these events, the 
DIL method requires a "jet significance" of > 8, defined as the ratio of Et to the sum of jet 
Et projected along the Et direction. The LTRK method increases the Et requirement to 
Et > 40 GeV for dilepton events near the Z resonance. The DIL method further suppresses 
background processes by requiring that the scalar sum of jet E T , lepton p T , and E T (denoted 
by H T ) exceed 200 GeV. 

Table IJ summarizes the luminosity and expected numbers of signal and background events 
for the DIL and LTRK selection methods, along with observed results from the inclusive 
lepton data set. The LTRK selection comprises a 6% greater luminosity since it is able to 
accept e/i dilepton decays when muon detectors were not operational. The acceptance and 
efficiency of ti signal events are calculated with a full detector simulation using PYTHIA ^| 
Monte Carlo and assuming a production cross section of 6.1 pb, corresponding to a top 
mass of 178 GeV/c 2 The Drell-Yan, W(—> tv) + jets fakes, and diboson background 

acceptances are estimated using a combination of Drell-Yan and iy+jets data, and PYTHIA 
and ALPGEN+HERWIG jl?], [3| simulation. The total uncertainties for expected event yields 
include both the statistical uncertainties of the Monte Carlo samples used, as well as sys- 
tematic uncertainties from particle identification, jet energy measurement, and modeling of 
the ti signal and background. Applied to the inclusive lepton data set, the DIL selection 
observes 33 events, and the LTRK selection observes 46 events, representing upward fluc- 
tuations for both selections from the predicted numbers of events at the assumed ti cross 
section. The DIL and LTRK data samples share 24 events in common, leading to a union 
of 55 events with a 44% overlap. 
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TABLE I: Luminosity, expected tt signal and background (with total uncertainties), and observed 
number of events for the DIL and LTRK selection methods. A tt cross section of 6.1 pb is assumed, 
corresponding to a top mass of 178 GeV/c 2 . 





DIL 


LTRK 


Luminosity 


340 pb -1 


360 pb -1 


Expected tt 


15.7 ± 1.3 


19.4 ± 1.4 


Drell-Yan 


5.5 ± 1.2 


8.7 ±3.3 


W(^£v)+jets fakes 


3.5 ± 1.4 


4.0 ± 1.2 


Diboson 


1.6 ±0.3 


2.0 ±0.4 


Total background 


10.5 ± 1.9 


14.7 ±3.6 


Total expected 


26.2 ±2.3 


34.1 ±3.9 


Observed 


33 


46 
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III. METHODS FOR TOP MASS MEASUREMENT 



Reconstruction of the top quark mass from dilepton events involves an underconstrained 
system. For lepton+jets decays, the two components of J£t generated by the single neutrino, 
along with other assumptions about the ti event (e.g., equal masses for the t and i quarks, 
and invariant masses of the Iv and qq' systems equal to the W mass) are enough to allow 
a kinematically overconstrained fit. For dilepton ti events, in contrast, the measured J£t is 
due to two neutrinos, so that the decay assumptions are insufficient to constrain the event. 

Specifically, for each ti event, the kinematics are fully specified by 24 quantities: the 
four-momenta of the six final state particles. Twelve three-momentum components of the 
two 6-quarks and two leptons are measured by the detector, along with the two components 
of • The four mass values of the final state 6-quarks and leptons are known, while the 
two neutrinos are assumed to be massless. Making three additional assumptions about the 
ti and W boson decays: 

mibtu) = m{btv) (1) 
m(tv) = m(W + ) (2) 
m(£-p) = m(W~) (3) 

results in only 23 measured, known, or assumed components of the system. Therefore, the 
top quark mass cannot be directly reconstructed from ti dilepton decays, but requires one 
additional kinematic assumption to constrain the system. 

In practice, for each event we integrate over undetermined kinematical variables to obtain 
distributions giving the relative likelihood of different values of the top quark mass. The 
three mass analyses are distinguished by different choices of kinematical variable, different 
methods for determining the likelihood of each top quark mass, and different approaches to 
distilling the resulting information into one top quark mass per event. This section describes 
each technique in turn. We model the ti decay kinematics and optimize each method over 
a large range of top quark masses, using HERWIG Monte Carlo simulation with CTEQ5L 
parton distribution functions. Potential biases in the reconstructed top quark masses are 
taken into account in the comparison of the measured distributions with top quark mass 
templates derived using the same simulation, as discussed in Section ITVl 
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A. Neutrino Weighting Algorithm (NWA) 



One method for estimating the top quark mass from dilepton events uses the Neutrino 
Weighting Algorithm (NWA). In Run I at the Tevatron, the NWA method was one of two 



techniques used by D0 [2Cj, and was employed by CDF 21 1 to measure the top quark 
mass. The method therefore provides a baseline for CDF Run II measurements, and is 
applied to the 360 pb~ x LTRK event sample. The strategy of the algorithm is to solve 
for the neutrino and antineutrino momenta, independently of the measured missing energy, 
by making additional assumptions about the ti decay. The neutrino/antineutrino solutions 
are then compared with the measured $t through a weight function in order to create a 
probability distribution for the event as a function of top quark mass. 

The NWA weight function is constructed as follows. We assume values for the top quark 
and W boson masses, the pseudorapidities of the neutrino and antineutrino, and the lepton- 
jet pairings associated with the top/antitop decays. We apply energy- momentum conserva- 
tion to the top quark decay and obtain up to two possible solutions for the 4-momentum 
(u) of the neutrino. We repeat this procedure on the antitop decay, resulting in up to 
four possible pairs of neutrino-antineutrino solutions (u,u). Each of the four solutions is 
assigned a probability (weight, Wi) that it describes the observed missing transverse energy 
components fi x and $ y within their uncertainties a x and cr y , respectively: 

Wi = eM~- ^ — )-exp( r-^ *—). (4) 

x y 

We use a x = o y = 15 GeV, which is obtained from ti Monte Carlo simulation generated with 
rrit = 178 GeV/c 2 . In practice, however, the performance of the algorithm is insensitive to 
the particular choice of J£t resolution. 

Given the assumed top quark mass and assumed neutrino rj values, any of the four solution 
pairs (u, u) have a priori equal probability. We therefore sum the four weights: 

4 

w(m t , r) u , r]p, i-jet) = 2j w%- (5) 

i=i 

Not knowing which are the true neutrino 77's in our event, we repeat the above steps for 
many possible (rj u , f] p ) pairs. As seen in the upper plots of Fig. El Monte Carlo ti simulation 
indicates that the neutrino 77's are uncorrelated, and follow a Gaussian distribution centered 
at zero with a width near one. Since the neutrino rj width varies little with top quark mass 
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(as shown in the lower plot of Fig. |2J), we assume a constant width for all top quark masses 
corresponding to the value of 0.988 obtained from the mt = 178 GeV/c 2 sample. To ensure 
symmetry and smoothness, we scan the neutrino r) distributions from —3 to +3 in steps of 
0.1, and each (r] u ,r] p ) pair is assigned a probability of occurrence P(i] u ,7] p ) derived from a 
Gaussian of width 0.988. Each trial {j] v - l r\y) pair contributes to the event according to its 
weight (Eq. EJ) and probability of occurrence, P{j] Vl rj D ): 

w(m t , E-jet) = ^2 Vp) ■ w ( m t, rj v , r] p , E—jet). (6) 

Vis, VP 

Since we do not distinguish b jets from b jets, both possible lepton-jet pairings are summed. 
Thus, the final weight becomes a function only of the top quark mass, after integrating over 
all other unknowns: 

f+-jet 2 

W(m t ) = w(m t , E-jet). (7) 

£+-jet 1 

We scan m t from 80 to 380 GeV/c 2 in steps of 1 GeV/c 2 . Figure 01 shows the resulting 
normalized weight distribution from Eq. [7| after applying the NWA method to a HERWIG 
Monte Carlo ti event, with a simulated top quark mass of 170 GeV/c 2 . We choose one 
indicative top quark mass for each event, selecting the most probable value (MPV) of the 
weight distribution as that which best explains the event as a ti dilepton decay. 

For a given event, there exists a small probability that the kinematics of the decay will fail 
to produce a solution for any scanned top quark mass. This efficiency for finding a solution 
is thus an additional event selection criterion. Studies of simulated ti dilepton events show 
that this NWA efficiency for signal is 99.8%, and independent of generated top quark mass. 
Applying the NWA method to Monte Carlo background samples shows that the efficiency 
for finding a kinematical solution varies between sources, ranging from 94-100%, with an 
average background efficiency of 96%. 

B. Full Kinematic Analysis (KIN) 

A second method for determining the top quark mass in the dilepton channel, called 
the Full Kinematic Analysis (KIN), is applied to the 340 pb _1 DIL selection sample. The 
KIN method resolves the underconstrained dilepton ti decays by introducing an additional 
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FIG. 2: Neutrino r/ distribution with Gaussian fit (upper left) and neutrino vs. antineutrino r/ 
(upper right) from a HERWIG ti sample with nit = 178 GeV/c 2 . Lower plot shows r\ width as a 
function of generated top quark mass, compared with fit value at m< = 178 GeV/c 2 (horizontal 
line). 



equation for the longitudinal momentum of the ti system, p^. With the 6-particle final 
state constrained, the KIN method solves the resulting kinematic equations numerically to 
determine the top quark mass for each event. 

Ideally, the quantity p** should be determined theoretically, and should be virtually in- 
dependent of the top quark mass. Studies from Monte Carlo simulation over a range of top 
quark masses from 140-200 GeV/c 2 show that has a Gaussian behavior, with a mean of 
zero and a width near 180 GeV/c. This width increases by roughly 10% across the top quark 
mass range studied. The validity of our Monte Carlo simulation can be tested with data 
from lepton+jets ti events, where p** can be reconstructed explicitly. Figure |U compares p** 
from the lepton+jets data sample with ti and background Monte Carlo samples, showing 
good agreement between data and simulation. The lepton+jets event selection, using sec- 
ondary vertex 6-quark identification, and subsequent backgrounds are similar to those of the 
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FIG. 3: NWA weight distribution as a function of top quark mass hypothesis (from Eq. [7J for a 
HERWIG Monte Carlo ti event with mt = 170 GeV/c 2 . The vertical line denotes the most probable 
value (MPV) of mt chosen by the method. 

lepton+jets cross section measurement (2^. 

Using the measured momenta of the 6-quarks and leptons, the two components of the 
measured $t > and assumptions about the six final-state particle masses, the additional 
constraint on p**, along with constraints on the W and ti decays, lead to the following set 
of kinematic equations: 



V 1 V 

Px + Px 


= fix 


Py+Py 


= fty 


pi + pi 


= ± 180 GeV/c 


m t 


= m 


m w ± 


= 80.4 GeV/c 2 


Pb + Pw+ 


= Pt 


Pb + Pw- 


= Pi 


Pl+ + Pv 


= Pw+ 


Pi- + Pv 


= pw-- 



These equations have two solutions, which are determined through an iterative procedure. 
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FIG. 4: Comparison of it longitudinal momentum between lepton+jets data (with associated 
Gaussian fit) and standard model processes. The data corresponds to a sample of 318 pb -1 . The 
tt signal uses PYTHIA Monte Carlo with with rat = 178 GeV/c 2 , corresponding to a cross section 
of 8.7 pb. 



If solutions cannot be found by using the above assumptions for the top and bottom quark 
masses, these requirements are relaxed, and we accept solutions where = 80.4 ± 

3.0 GeV/c 2 and m t = mi ± 2.0 GeV/c 2 . If no solutions are found after relaxing the mass 
requirements, the event is rejected. 

The iterative procedure employed, Newton's Method j^J, solves equations of the form 
f(x)=0. The method requires an initial guess for x which is reasonably close to the true 
root. The local derivative f'(x) is then computed and extrapolated to zero, providing a 
better approximation for the root. This procedure is repeated according to: 

x n+l =x n_ (Q) 

until a satisfactory solution is found. The method is extended to a system of k equations 
F(x) = fi(x) by determining the kxk Jacobian matrix J l ^{x) = , where (z = l,k;j = 
1, k). In actuality, the method solves the linear equations: 

J F {F) ■ -x n ) = -F^) (10) 

for the unknown x" 1 ^ — x™, in order to avoid having to compute the inverse of Jp(x n ). 

on 



Applying Newton's Method to the tt decay system of Eq. |HJ we determine the first of 
two pairs of quadratic solutions for the neutrino momentum according to the following set 



of three equations: 

fM\Py\p?) = (E h + E vi f - (p h +p Ul f -m 2 w = (11) 

f2(p v x \Vy\V?) = (E h + E V2 f - (p l2 +p V2 f -m 2 w = (12) 

f 3 (p?, P:\P?) = (E h + E U1 + E bl ) 2 - (p h +p Vl +p bl ) 2 (13) 



- {E h + E V2 + E b2 ) 2 + {p h + p V2 + p h2 ) 2 = 

from which the full kinematic chain is reconstructed, and the top quark mass solutions are 
calculated. The second quadratic solution for neutrino momentum pj = p Vx + X satisfies 
the following set of equations: 

f 1 (x 1 ,x 2 ,x 3 ) = ^m 2 w + (p Wi+ xy-E h -^J(p vi+ xy = (14) 
f 2 (x u x 2 , x 3 ) = \/m 2 w + {p W2 - X) 2 -E h -\/ {p U2 - X) 2 = (15) 



f 3 (xi,x 2 ,x 3 ) = 




- \j{^w + (pw 2 - xy + E b2 y - (? t2 - xy = o 

from which a second pair of top quark mass solutions is found. Since there are two possible 
combinations of 6-quark jets and leptons, we have a total of eight possible solutions for the 
top quark mass. 

In order to incorporate the large range of possible p l l values about the mean of zero 
(as seen in Fig. 0]), as well as the finite resolutions of the measured momenta and E? , the 
above procedure is repeated 10,000 times for each possible solution. For each repetition, the 
value of pf is drawn from a Gaussian distribution with zero mean and width of 180 GeV/c. 
The jet energies and Et are similarly smeared by Gaussians according to their estimated 
resolutions, while the relatively better resolutions on the measured jet angles and lepton 
momenta are assumed to be perfectly measured. Kinematic reconstruction of the smeared 
events results in a distribution of possible top quark masses for a given event (consistent with 
the measured kinematic characteristics of the event and the measurement uncertainties). 
The most probable value (MPV) of a spline fit to this mass distribution is then taken as the 
"raw top quark mass" for a given solution. 
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FIG. 5: Smeared top quark mass distributions of the "favored" and "disfavored" lepton-jet pairings 
from the KIN method applied to a HERWIG Monte Carlo ti event with m t = 170 GeV/c 2 . Also 
shown for the "favored" pair is the spline fit used to select the top quark mass for a given solution. 

The KIN method then selects a single "raw top quark mass" from the eight possible 
solutions as follows. Of the four possible solutions for each lepton-jet pairing, we choose that 
with the smallest effective mass of the ti system. Based on simulated events at m t = 178 
GeV/c 2 , this particular mass solution is closest to the generator-level top quark mass for 
approximately 84% of the events. The smeared mass distributions of the remaining two 
possible solutions (due to the two lepton-jet pairings) are then compared, as shown in Fig.|3] 
for an example simulation event. We choose the lepton-jet pair which produces the largest 
number of entries (i.e. fewest number of rejections) in the smeared distribution. The mass 
solution from this kinematically "favored" pair is found to be closest to the generated top 
quark mass for about 70% of events. In this manner, the KIN method returns a single top 
quark mass for each ti dilepton event. Although this method necessarily has a bias towards 
lower top masses, this bias is fully included in the simulation used to extract the final top 
mass value. 
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C. Neutrino cj> Weighting Method (PHI) 



A third procedure for analyzing it dilepton decays, referred to as the Neutrino Weighting 
Method (PHI), most closely resembles the Run I lepton+jets template analysis |2l|. Intro- 
ducing additional assumptions about the azimuthal angle <fi of the final-state neutrinos, this 
method reconstructs dilepton decays through the minimization of a chi-square functional 
(x 2 ) to arrive at a single top quark mass for each event. As with the KIN analysis, the PHI 
method uses the 340 pb _1 DIL selection sample. 

The x 2 functional to be minimized takes the form: 



2 _ ^ 04 ~ P 1 t? , \- (P J T ~ p't? , f (UE l - UE-j n7 , 

£=1 U PT j=l a VT i=l U UE 



(m tlVl - m w ) 2 {m l2U2 - m w f (m jltlVl - m t f (m hl2V2 - m t ) 2 

p2 r 2 p2 ~"~ p2 

1 1 1 t 1 t 

The first term sums over the primary lepton transverse momenta p^, with detector resolu- 
tions for the electrons and muons taken to be P|: 



vL 0.135 



PT 

e 



2 



Pt 
a." 




+ 0.02 2 (18) 



-p- = 0.0011 -p*. (19) 

The second x 2 term sums over the transverse momenta p 3 T of the two leading jets. These 
transverse momenta have been further corrected for underlying event and out-of-cone en- 
ergy, and have a px and ^-dependent detector resolution a 3 pr derived from simulation. The 
quantity UE (with uncertainty (Jue) m the third x 2 term denotes the unclustered energy 
in the calorimeter, summed over (i — 1, N) towers, which is not associated with a lepton 
or leading jet calorimeter cluster, but includes any additional jets with Et > 8 GeV/c 2 and 
\r]\ < 2.5. The quantities m&, and rrij£ U in Eq. El r efer to the reconstructed invariant masses 
of the W boson and top quark decay products, respectively. For the W boson decay width 
we use the P.D.G. value Ty/ = 2.1 GeV/c 2 24], while for the top quark we assume a width of 
F t = 2.5 GeV/c 2 . Variables with a tilde refer to the output of the minimization procedure. 
The quantity m t is the fit parameter returned as the reconstructed top quark mass for the 
combination being considered. 

To resolve the neutrino momentum used in the W and top decay constraints of Eq. IT71 
two additional assumptions are needed. Assuming values for both neutrino azimuthal angles 
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*>vi, <Pu2), the transverse momenta of the neutrinos are linked through the measured Pt by: 



Vt ■ cos(M + Vt ■ cos(0, 2 ) = p x (20) 



Pt ■ sin^i) + • sin(4> u2 ) = P> 



leading to the solutions: 
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= Pt 


• sin(< 
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= Pt 2 - 


cos(< 
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= P? 


• sin(. 
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■ sin (<f) u2 
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^ ■ cos(i 
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(21) 



^. 



■ sin (0^2 J 



sin(0^i - y2 ) 

Performing the x 2 minimization of Eq. El on all allowed values of neutrino creates a set of 
solutions in the (0„i, 4>v2) plane. In practice, only points in the quadrant (0 < 4>vi < 7r, < 
0i/2 < tt) need to be sampled, since identical neutrino momentum components from Eq. I2T1 
occur for the four points (0^,0^), (0^i + ^,0^), (0i/i,0i/2 + 7r), and (0^ + 7T, 0„ 2 + tt)- 
Since p^ 1 '^ 2 must be positive by definition, and will only change sign by adding it to <p v i,u2, 
only one of the four points represents a physical solution. Solutions from other points are 
unphysical and can be interpreted as "mirror reflections" of the physical solution. 

A grid of 12 x 12 points in the (<p u i, 0^2) plane is chosen, in a manner which avoids points 
where sin(0 yl — 0^) = and Eq. |2U becomes undefined. At each point, 8 solutions exist due 
to the two-fold ambiguity in longitudinal momentum for each neutrino, and the two possible 
lepton-jet combinations. Thus, for each event, 1152 minimizations of Eq. El are performed, 
each returning an output y 2 and reconstructed top quark mass m rec . The minimal value for 
X 2 among the 8 possible solutions at each point is retained, reducing each event to an array 
of 144 Xij an d fn]j C values, where (i = 1, 12; j = 1,12) refer to the (0^1, 0^2 ) grid points. 
Each point is weighted by its returned \ 2 value according to: 

_ exp(- X 2 -/2) 

W *"2^E£x«p(-xS/2) [ } 

to create a probability density distribution normalized to unity. 

To arrive at a single top quark mass value per event, the reconstructed mass values m-| c 
of the array are averaged, using the weights derived from Eq. |22l The sensitivity to the top 
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FIG. 6: Binned weight distribution from the PHI method for a HERWIG Monte Carlo tt event with 
mt = 170 GeV/c 2 , showing the resulting average mass for bins above the 30% discrimination level 
(DL). 



quark mass is enhanced by averaging only points with a weight at least 30% that of the 
most probable value in the probability density distribution. Figure El shows the results of 
the PHI method applied to a HERWIG Monte Carlo tt event with m t = 170 GeV/c 2 . 
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IV. TEMPLATE LIKELIHOOD PROCEDURE 

The three independent measurement techniques described in Section UTTl produce a single 
top quark mass for each event in their corresponding data samples, which are mixtures 
of ti signal and background events. To arrive at a final top quark mass measurement, 
these data events are compared with probability density functions (p.d.f.'s) for signal and 
background within a likelihood minimization. The p.d.f.'s are developed from template 
mass distributions created by applying the NWA, KIN, and PHI methods to simulated it 
signal and background samples, which are then parameterized. For the NWA and PHI 
methods, this parameterization uses a combination of Gaussian and gamma distribution 
terms. Similarly, the KIN method parameterization contains a Gaussian term in conjunction 
with an approximate Landau distribution. 

A. Template construction 

For the signal, we use ti dilepton events generated with HERWIG Monte Carlo simulation 
for top quark masses from 130 to 230 GeV/e* in 5 GeV/c* increments. The CTEQ5L Q 
Structure Functions are used to model the momentum distribution of the initial state par- 
tons. For the NWA and PHI methods, the signal templates obtained from this simulation 
are parameterized as the sum of a Gaussian and a gamma distribution. This parameteri- 
zation gives the signal p.d.f, P s (m; m t ), representing the probability of reconstructing a top 
quark mass m when the true mass is m t : 



The six parameters aij in Eq. EH] are assumed to be linearly dependent on the generated 
top quark mass, such that we in fact perform a 12-parameter fit for pi on all templates 
simultaneously, with: 



Figures [7| and |H1 show representative signal templates from the NWA and PHI methods, with 
the corresponding parameterized fitting function. 
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FIG. 7: Example signal templates as a function of the reconstructed top quark mass, from the 
NWA method applied to simulated signal samples at top quark masses of 130, 160, 190 and 220 
GeV/c 2 . Overlaid are the parameterized fitting functions using Eg. 1231 The vertical line indicates 
the generated top quark mass. 



The signal template parameterization employed by the KIN method contains the Gaus- 
sian and Landau-like terms: 

m - a 4 (m t ) 

^Gauss — : s I/O ) 

a 5 {m t ) 
m-anjmt) 

^Landau — 7 s {*V) 

a 2 {m t ) 

which form the probability density function: 

P s {m;m t ) = exp(-0.5(A Landau + exp(-A Lan dau))) (27) 

+ (1 -'? (m ' )) exp(-0.5AL m ,) 

for reconstructing a top quark mass m given a true mass mj. The Gaussian and Landau terms 
are normalized for solutions within the reconstructed mass range 90 < m < 300 GeV/c 2 by 
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FIG. 8: Example signal templates as a function of the reconstructed top quark mass, from the 
PHI method applied to simulated signal samples at top quark masses of 140, 160, 190 and 220 
GeV/c 2 . Overlaid are the parameterized fitting functions using Eg. 1231 The vertical line indicates 
the generated top quark mass. 



the integrals: 

/■300 

h = / e-^^^^Mm, (28) 

,•300 

J 2 = / e"^ A G— s dm. (29) 

The parameters a, (i = 1,5) are simultaneously fit to all templates by assuming a linear 
dependence on the true top quark mass m t : 

&i{m t ) = cii + bi* m t . (30) 

Example signal templates using the KIN method parameterization are shown in Fig. El 
We observe that, for all template methods, the mean of the signal template lies above the 
generated top quark mass for the m t = 140 GeV/c 2 sample, but moves below the generated 
value for higher mass samples. 

For the background events, we create one representative background template by adding 
the individual templates from each background source according to their expected yields from 
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FIG. 9: Example signal templates as a function of the reconstructed top quark mass, from the 
KIN method applied to simulated signal samples at top quark masses of 130, 160, 190 and 220 
GeV/c 2 . Overlaid are the parameterized fitting functions using Eg. 1271 The vertical line indicates 
the generated top quark mass. 



Table |l] The templates from the various background processes are reconstructed from fully 
simulated Monte Carlo samples: the Drell-Yan events from PYTHIA, the W(— »£z/)+jets fakes 
from ALPGEN+HERWIG simulation of W(— > ev) + 3 partons, and the diboson from PYTHIA 
and ALPGEN+HERWIG. In combining these sources for each mass measurement technique, 
the measured efficiencies for finding a mass solution for each simulated background source 
are taken into account. We obtain the background p.d.f. (Pb(m)) by fitting the combined 
background template with a functional form identical to that used for the signal templates 
(Eq. [21 for the NWA and PHI methods, and Eq. EH for the KIN method), but with fitted 
parameters independent of true top quark mass m t . The resulting mass templates for the 
three background sources, along with the combined background template and parameterized 
fit, are plotted for each method in Figs. HOlfT^l 
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FIG. 10: Reconstructed top quark mass templates for the Drell-Yan, Diboson, and Fakes back- 
ground sources using the NWA method, along with the combined background template and as- 
sociated fitted probability density function. Background sources are normalized to the expected 
contribution in the 360 pb" 1 LTRK sample. 

B. Likelihood minimization 



The final step for each dilepton template analysis is the determination of a representative 
top quark mass from the data sample by performing a likelihood fit and minimization. The 
likelihood function finds the probability that our data are described by an admixture of 
background events and dilepton ti decays with a certain top quark mass. As input we use 
the top quark mass values returned by the particular mass measurement technique applied 
to the data sample, and the parameterized probability density functions of the signal and 
background templates derived from simulation. 

The total likelihood takes the form: 



C(m t ) = C shapc (m t ) x C 



n b 



where, 



e {ns+nb \n s + n b ) N n s P s (mi; m t ) + n b P b [ 



n 



rrii 



n s + n b 



(31) 
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FIG. 11: Reconstructed top quark mass templates for the Drell-Yan, Diboson, and Fakes back- 
ground sources using the KIN method, along with the combined background template and as- 
sociated fitted probability density function. Background sources are normalized to the expected 
contribution in the 340 pb _1 DIL sample. 



and 

-UC m= ( ^fl. (33) 

The likelihood returns a true top quark mass hypothesis (m t ), and estimated numbers of 
signal (n s ) and background events. We assign a probability that each event (?) looks 
like signal and a probability that it looks like background. The signal and background 
probabilities are assigned by comparing the measured top quark mass values rrii from the data 
with the parameterized signal and background p.d.f.'s P s and We find the probabilities 
that the likelihood-estimate for the number of background events n& is consistent with our 
a priori estimate % xp , and that the likelihood-estimate for the total number of signal (n s ) 
and background events is consistent with the observed number of events N. The number of 
background events is constrained with a Gaussian about n^ xp (of width equal to the expected 
background uncertainty o~ nb ), while the sum of n s and rib is constrained with a Poisson 
term. In this manner, the likelihood-estimated number of signal events is independent of 
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FIG. 12: Reconstructed top quark mass templates for the Drell-Yan, Diboson, and Fakes back- 
ground sources using the PHI method, along with the combined background template and as- 
sociated fitted probability density function. Background sources are normalized to the expected 
contribution in the 340 pb _1 DIL sample. 



the expected number of signal events based on an assumed ti cross section. The true top 
quark mass hypothesis (mj) which minimizes — ln(£) is retained. 

The statistical uncertainty on m t is given by the difference between the minimization 
mass result and the mass at — ln(£/£ max ) + 0.5. In the NWA and KIN analyses, uncer- 
tainty on the top quark mass from uncertainties in the signal and background template 
parameterizations (due to limited statistics of the simulated template samples) is estimated 
and included as a systematic uncertainty (see Section \Vll\i . The PHI analysis incorporates 
this parameterization uncertainty directly into the top quark mass statistical uncertainty 
through the addition of a third term to the likelihood function (Eq. I3*T]) : 

A™ = exp (-0.5{(a - a ) T U-\a - a ) + 0- fa) T V-\P - A,)}) (34) 

where U and V represent the covariance matrices of the signal and background parameters 
a and (3, respectively. 
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V. TESTING WITH PSEUDO-EXPERIMENTS 



We use a large number of simulated data ensembles, or pseudo-experiments, to check 
whether the methods for mass measurement described above return the expected top quark 
mass. For each generated top quark mass from 150 to 210 GeV/c 2 , we construct a set of 
pseudo-experiments. Each pseudo-experiment consists of masses drawn randomly from the 
signal and background mass templates (e.g., Figs.lTland lTm) . The numbers of signal and back- 
ground events in each pseudo-experiment are given by random Poisson fluctuations around 
the a priori estimates from the DIL and LTRK selections (see Table ITTj) . These estimates 
correspond to a tt cross section of 6.1 pb, and are adjusted for the reconstruction efficiency 
of each method for finding top quark mass solutions for signal and background events. The 
likelihood minimization procedure described in the previous section provides a "measured" 
top quark mass and statistical uncertainty for each pseudo-experiment. Figures fTSHTBI show 
the results from these pseudo-experiments for the NWA, KIN, and PHI methods, respec- 
tively. The upper plots show that the measured output top quark mass tracks the generated 
input mass. From the lower plots we observe that the residual differences between input and 
output top quark mass are consistent with zero for all methods, within uncertainties due to 
the limited statistics of the signal and background mass templates. 

In order to check the consistency between the spread in output top quark mass and 
the estimated positive (cr + ) and negative (<r_) statistical uncertainties from the pseudo- 

TABLE II: Expected signal and background events for the NWA, KIN, and PHI methods applied 
to the LTRK (360 pb -1 ) or DIL (340 pb" 1 ) selections, and corresponding to a tt cross section 
of 6.1 pb. Event numbers are adjusted for signal and background reconstruction efficiencies (in 
parentheses). Also shown is the a priori statistical uncertainty on top mass for each method using 
the mt = 178 GeV/c 2 simulation sample and correcting for underestimation found in pulls (by the 
scale in parentheses). 

method luminosity expected sig. expected bkg. expected er s tat 

NWA 360 pb- 1 19.4 ±1.4 (99.8%) 14.1 ±3.5 (96%) 12.8 GeV/c 2 (1.060) 

KIN 340 pb" 1 12.9 ±1.1 (75%) 6.4 ±1.2 (61%) 15.1 GeV/c 2 (1.033) 

PHI 340 pb" 1 17.2 ± 1.4 (100%) 10.5 ± 1.9 (100%) 14.5 GeV/c 2 (1.055) 
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FIG. 13: Results from pseudo-experiment tests of the NWA method. The upper plot shows the 
mean of the output (measured) top quark mass as a function of the input (generated) mass, while 
the lower plot gives the difference between output and input top quark mass as a function of the 
input mass. 
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FIG. 14: Results from pseudo-experiment tests of the KIN method. The upper plot shows the 
mean of the output (measured) top quark mass as a function of the input (generated) mass, while 
the lower plot gives the difference between output and input top quark mass as a function of the 
input mass. 
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FIG. 15: Results from pseudo-experiment tests of the PHI method. The upper plot shows the 
mean of the output (measured) top quark mass as a function of the input (generated) mass, while 
the lower plot gives the difference between output and input top quark mass as a function of the 
input mass. 



experiments, pull distributions are generated according to: 

m out n^in ; or \ 

PUU = K + a.)/2 (35) 

for each of the generated samples at different input mass (with examples shown in Fig. fTEj) . 
Figures ITTlfTTH summarize the pull mean and width for the NWA, KIN, and PHI methods 
as a function of generated top quark mass, with corresponding uncertainties due to mass 
template statistics. Non-unity widths of the pull distributions indicate that the statistical 
uncertainty is underestimated for the three analyses. Therefore, we scale the uncertainties 
obtained from the likelihood fit on the data by the underestimation determined from the 
pseudo-experiments. Using the m t = 178 GeV/c 2 HERWIG simulation and assuming a ti 
cross section of 6.1 pb, Table ITT1 compares the expected statistical uncertainty of the three 
measurement techniques after applying this correction due to observed pull width (shown 
in parentheses). 



3F> 




FIG. 16: Example pull distributions for the NWA (upper), KIN (middle), and PHI (lower) pseudo- 
experiments, corresponding to generated mass samples at 140, 170, and 200 GeV/c 2 . Each pull 
distribution is fit to a Gaussian (solid line), with noted standard deviation (a). 
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FIG. 17: Summary of pull distributions for the NWA pseudo-experiments, showing the pull mean 
(upper) and width (lower) as a function of generated input top quark mass, compared with zero 
mean and unity width (horizontal lines). 
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FIG. 18: Summary of pull distributions for the KIN pseudo-experiments, showing the pull mean 
(upper) and width (lower) as a function of generated input top quark mass, compared with zero 
mean and unity width (horizontal lines). 
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FIG. 19: Summary of pull distributions for the PHI pseudo-experiments, showing the pull mean 
(upper) and width (lower) as a function of generated input top quark mass, compared with zero 
mean and unity width (horizontal lines). 
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VI. RESULTS 



The NWA procedure is applied to the 46 events satisfying the LTRK selection in 360 pb _1 
of Run II data, with 45 events resulting in NWA solutions. The KIN and PHI analyses are 
applied to the 33 events of the DIL selection sample, corresponding to 340 pb _1 . Of this 
sample, 30 events pass kinematic reconstruction in the KIN method, while the PHI analysis 
returns solutions for all 33 events. Each method applies the likelihood procedure described 
in Sec. IIVI using the expected number of background events listed in Table ITT1 for the LTRK 
or DIL selection sample after accounting for mass reconstruction efficiency As listed in 
Table llll| each likelihood fit returns a constrained number of background events consistent 
with the expected value. The number of signal events returned from each likelihood fit 
ensures that the total number of events in the likelihood agrees with that observed, thereby 
accounting for the upward fluctuations in both the DIL and LTRK selection samples. 

The upper plots of Figs. show for the NWA, KIN, and PHI methods, respec- 

tively, the reconstructed top quark mass in the data, the normalized background and sig- 
nal+background shapes, and the variation of — ln(£/£ max ) as a function of the top quark 
mass hypothesis. For each method, the final top quark mass is taken as the value of m t 
which minimizes the likelihood function. Statistical uncertainties are obtained by taking the 
width at — ln(£ / £ max ) +0.5, and adjusting for the underestimation found in pull widths from 
Figs. fTTlfTTfl Table IIHI summarizes the measured top quark mass and statistical uncertainty 

TABLE III: Summary of results for the NWA, KIN, and PHI methods applied to the LTRK 
and DIL data samples. Listed for each method are: number of total observed events in the 
sample, number of events with mass solutions, expected number of background events, constrained 
likelihood fit values for signal and background events and top quark mass, and unconstrained 
likelihood mass. 
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FIG. 20: Results for the NWA method applied to the 46-event LTRK data sample, leading to 
45 solutions. Upper plot: reconstructed top quark mass for the data events (histogram), with 
normalized background and signal+background p.d.f. curves, and the likelihood function (inset). 
Lower plot: comparison of measured positive and negative statistical uncertainties in the data 
sample (vertical lines) with pseudo-experiments generated using the 170 GeV/c 2 signal template 
and assuming at least 45 events observed. 



for the three mass methods after pull width corrections. 

The lower plots of Figs. I20ll2*2*l compare the measured statistical uncertainties of the NWA, 
KIN, and PHI methods with pseudo-experiments using the m t = 170 GeV/c 2 sample which 
have the same number of events as that observed in the data for each method. We find the 
probabilities for achieving the observed statistical uncertainties to be 9%, 23%, and 19% 
for the NWA, KIN, and PHI methods, respectively. As a further cross-check, we remove 
the Gaussian constraint on the number of background events in the likelihood procedure 
(i.e., the term C nb in Eq. 13 For all three methods, this unconstrained fit converges near 
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FIG. 21: Results for the KIN method applied to the 33-event DIL data sample, leading to 30 
solutions. Upper plot: reconstructed top quark mass for the data events (histogram), with normal- 
ized background and signal+background p.d.f. curves, and the likelihood function (inset). Lower 
plot: comparison of measured average statistical uncertainty in the data sample (vertical line) with 
pseudo-experiments generated using the 170 GeV/c 2 signal template. 



zero background events, which is found to occur for 21% (NWA), 31% (KIN), and 20% 
(PHI) of pseudo-experiments using the m t = 170 GeV/c 2 sample. The resulting top quark 
mass of the unconstrained fit, corrected for pull width, is consistent with the constrained 
result for each mass method (as seen in Table ITTTj) . The statistical uncertainty returned 
by the unconstrained fit on the data sample is smaller than that of the constrained fit for 
all methods. However, from studies of pseudo-experiments at m t = 170 GeV/c 2 we expect 
on average an improvement in statistical uncertainty of 1.1, 1.5, and 1.2 GeV/c 2 for the 
NWA, KIN, and PHI methods, respectively, when applying the background constraint to 
the likelihood function. 
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FIG. 22: Results for the PHI method applied to the 33-event DIL data sample. Upper plot: 
reconstructed top quark mass for the data events (histogram), with normalized background and 
signal+background p.d.f. curves, and the likelihood function (inset). Lower plot: comparison of 
measured positive and negative statistical uncertainties in the data sample (vertical lines) with 
pseudo-experiments generated using the 170 GeV/c 2 signal template. 
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VII. SYSTEMATIC UNCERTAINTIES 



Apart from the statistical uncertainty on the measured top quark mass due to the limited 
size of our data sample, there are several sources of systematic uncertainty. These systematic 
effects stem from uncertainty in the Monte Carlo simulation of tt and background events, 
from mismodeling by the simulation of the detector response to leptons and jets, and from the 
validity of various assumptions made during the implementation of the mass measurement 
techniques. As such, most sources of systematic uncertainty are common to all three mass 
analyses, and are estimated by adjusting a particular input value to the simulation and 
constructing new mass templates. We then perform pseudo-experiments using events drawn 
from the new mass templates, and compare the resulting median reconstructed top quark 
mass with that of the nominal simulation. The sources of systematic uncertainty within 
each mass analysis are assumed to be uncorrelated, so that a total systematic uncertainty 
for each method is calculated as the sum in quadrature of the various sources, as summarized 
in Table HY1 

One of the largest sources of systematic uncertainty arises from potential mismodeling 
of the jet energy measurement, through uncertainties in the various corrections applied to 
the measured jet energy These jet energy corrections involve the non-uniformity in re- 
sponse of the calorimeter as a function of rj, effects from multiple pp collisions, the absolute 
jet energy scale for hadrons, energy deposition from the underlying pp event, and energy loss 
outside the jet search cone AR. A systematic uncertainty is estimated for each jet energy 
correction by performing pseudo-experiments drawn from signal and background templates 
with ±1 standard deviation in correction uncertainty, and taking the half-difference in me- 
dian reconstructed top quark mass between the two results. The uncertainties from each 
energy correction are then added in quadrature to arive at a total systematic uncertainty 
on the jet energy scale. 

Since the above jet energy corrections are developed from studies of samples dominated 
by light-quark and gluon jets, additional uncertainty occurs from extrapolating this pro- 
cedure to 6-quarks. The resulting systematic effect on jet energy is considered to stem 
from three main sources: uncertainty in the 6-jet fragmentation model, differences in the 
energy response due to semi-leptonic decays of 6-hadrons, and uncertainty in the color flow 
within top quark production and decay to 6-jets j^J. As in the jet energy scale uncertainty, 

J.3 



pseudo-experiments are performed on events where the 6-jet energies have been altered 
by ±1 standard deviation for each uncertainty, and the resulting half-differences added in 
quadrature to estimate the total systematic uncertainty due to 6-jet energy uncertainty. 

Several systematic uncertainties are due to the modeling of the tt signal. We study the 
effects of the particular Monte Carlo generator chosen by comparing pseudo-experiments 
drawn from PYTHIA simulation with events taken from our nominal signal templates con- 
structed using HERWIG. These generators differ in their hadronization models and in their 
handling of the underlying pp event and multiple pp interactions 2(| • We take the difference 
in reconstructed top quark mass between HERWIG and PYTHIA pseudo-experiments as the 
systematic uncertainty due to choice of generator. The systematic uncertainty associated 
with the initial state radiation (ISR) is studied by changing the QCD parameters for parton 
shower evolution according to comparisons between CDF Drell-Yan data and simulation j^j]. 
Since final state radiation (FSR) shares the same Monte Carlo shower algorithms as ISR, 
these variations in QCD parameters are used to generate FSR systematic samples by varying 
a set of parameters specific to FSR modeling. We then compare the reconstructed top quark 
mass from samples with increased and decreased ISR and FSR to estimate the systematic 
uncertainty due to these sources. The uncertainty in reconstructed top quark mass from 
our choice of parton distribution function (PDF) is found by comparing two different groups 
(CTEQ5L Q and MRST72 27|). Additionally, MRST72 and MRST75 sets, derived using dif- 
ferent Aqcd values, are compared, and 20 eigenvectors within the CTEQ6M group are varied 
by ±1 standard deviation. Differences in pseudo-experiment results from these variations 
are added in quadrature to arive at a total systematic uncertainty from the choice of PDF. 
Further studies comparing LO with NLO tt Monte Carlo show a negligible effect on the 
reconstructed top quark mass. 

Since our background template is also derived from simulation, another source of system- 
atic uncertainty reflects the potential mismodeling by the Monte Carlo of the background 
shape. Background events may pass event selection through processes which are not accu- 
rately modeled in the simulation, such as tracks or jets passing through gaps in detector 
elements. This uncertainty is estimated by measuring the resulting top quark mass from 
pseudo-experiments where the track p T , jet energy, and $t of each background sample 
have been altered by the measured discrepancies in these quantities between simulation and 
data. Another systematic uncertainty affecting the background shape is due to uncertainty 
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in the relative composition of the background sources used to construct the total back- 
ground template. To estimate this uncertainty, we measure the effect on top quark mass 
from pseudo-experiments where the relative combination of Drell-Yan and fake backgrounds 
(the largest two sources) is adjusted by predicted uncertainty 

The finite statistics in the simulated signal and background templates result in a sys- 
tematic uncertainty on the parameterized p.d.f's used in the likelihood (Eq. E2J), even if 
modeling of the signal and background processes is correct. As described in Section ITVl the 
PHI method accounts for this uncertainty in template parameterizations within the statisti- 
cal uncertainty returned by the likelihood minimization through the term £ para m of Eq. H2] 
The NWA and KIN procedures estimate directly the top quark mass uncertainty due to 
finite template statistics, and incorporate this effect into the total systematic uncertainty. 
For each signal template, we Poisson fluctuate the number of events in each bin to create 
a new template, which is parameterized according to Eqs. EH1 and EH We then perform 
pseudo-experiments drawing signal events from the nominal templates but applying them 
to a likelihood fit with the fluctuated signal p.d.f. in Eq. E21 producing a distribution of 
reconstructed top quark mass. Repeating this procedure many times, we estimate the sys- 
tematic uncertainty due to limited statistics in the signal templates as the root mean square 
of the median reconstructed top quark masses from the fluctuated pseudo-experiments. In a 
similar fashion, we estimate the analogous systematic uncertainty due to limited background 
template statistics by fluctuating each template bin of the various background components. 
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TABLE IV: Summary of the systematic uncertainties on the top quark mass measurement (in 
GeV/c 2 ) for the NWA, KIN, and PHI analyses. The total uncertainty is obtained by adding the 
individual contributions in quadrature. (The uncertainty due to signal and background template 
statistics for the PHI method is accounted for in the total statistical uncertainty). 



systematic source NWA KIN PHI 



Jet energy scale 


3.4 


3.2 


3.5 


6-jet energy 


0.6 


0.6 


0.7 


MC generator 


0.5 


0.6 


0.7 


PDF's 


0.5 


0.5 


0.6 


ISR 


0.6 


0.6 


0.6 


FSR 


0.5 


0.3 


0.4 


Background shape 


2.6 


1.6 


1.5 


Template statistics 








Signal 


0.2 


0.4 


n/a 


Background 


1.3 


1.2 


n/a 


Total 


4.6 


4.0 


4.0 



VIII. COMBINATION OF MEASUREMENTS 



Table IS shows the results, including statistical and systematic uncertainties, for the 
NWA, KIN, and PHI analyses. The three results are consistent, and can be combined to 
improve upon the overall precision of the top quark mass measurements using the template 
method. The combination procedure follows the Best Linear Unbiased Estimation (BLUE) 
method 



28| . In this technique, the final result consists of a linear combination of the 
individual measurements. The measured statistical and systematic uncertainties for each 
measurement, along with their correlations, are used to construct an error matrix, which 
upon inversion gives the corresponding weights for each method within the combined result. 

The statistical correlations between methods are determined from simulated samples over 
a range of top quark masses from 155 to 195 GeV/c 2 . Pseudo-experiments from these sam- 
ples, each corresponding to an integrated luminosity of 350 pb _1 , are constructed. The LTRK 
and DIL selection criteria are applied to the pseudo-experiments to model the expected sig- 
nal sample, as well as correlations between the selection methods. Simulated background 
events are added to each pseudo-experiment according to the expected contributions to each 
selection from the three background sources (as listed in Table H}. Based on studies from 
simulation, the W(—> tv) +jets fake background is assumed to be uncorrelated between the 
DIL and LTRK selections. For the diboson and Drell-Yan DIL backgrounds, a Poisson fluc- 
tuated number of events with mean (Ndh,) is added to each pseudo-experiment, drawn from 
a pool of events passing the DIL selection. The LTRK diboson and Drell-Yan backgrounds 
are constructed by taking into account the expected number of common events, ^ltrk dil, 
between the two selections, as determined from simulation. These LTRK backgrounds are 
thus the union of Altrk-dil common events with a number of events (Altrk — Altrk-dil) 
drawn from the pool of those events passing only the LTRK selection. From these pseudo- 
experiments of signal and background events, we find an overlap between the LTRK and 
DIL selections of approximately 30%, compared with the 44% overlap observed in the two 
data samples. 

The three measurement methods are applied to each pseudo-experiment, returning a 
reconstructed top quark mass and expected statistical uncertainty. Pseudo-experiments 
where the likelihood minimization for any of the methods fails to converge are removed from 
consideration. This situation occurs for approximately 0.8% of all pseudo-experiments. Also, 
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pseudo-experiments must have a returned mass which lies within the likelihood minimization 
mass range of all methods. The correlation coefficient between two mass measurements rrii 
and rrij is calculated from the N pseudo-experiments at each mass sample by the equation: 

p = N E m i m i - E m i E m i ^ 

^ y/N E ™? - (E rm)*y/N E ™) - (E ™;) 2 ' 
These statistical correlations are shown in Table IVl for the m t = 170 GeV/c 2 sample, and 
are stable across the 155 to 195 GeV/c 2 mass samples. Systematic uncertainties common 
to all three methods, as listed in Table I1V[ are assumed to be 100% correlated, while the 
systematic uncertainties due to limited template statistics for the NWA and KIN methods 
are assumed to be uncorrelated. After incorporating systematic effects, Table |V| shows the 
total correlations between methods in parentheses, and the resulting weights of the methods 
after inversion of the constructed error matrix. 

Since measurements producing smaller mass values tend to have correspondingly smaller 
statistical uncertainties, an iterative combination procedure is performed in order to prevent 
bias towards lower mass values. In this procedure, each measurement method is assumed 
to have a constant fractional statistical uncertainty (taken from the expected uncertainty at 
m t = 178 GeV/c 2 in Table ITT]). Combining the three measurements, each method's statistical 
uncertainty is then extrapolated to the combination value, and the procedure repeated until 
the combined result converges. 

Potential bias in the combination technique is studied by using the pseudo-experiment 
results from each method as input. As shown in Fig. |2S1 we observe no bias in the residual 
difference between the input and output top quark mass of the combination result for samples 
above m t = 160 GeV/c 2 . However, non-unity pull widths for all mass samples indicate that 
the statistical uncertainty of the combination procedure is slightly underestimated. Studies 
of toy Monte Carlo pseudo-experiments suggest that this underestimation may stem from 
deviations in the data sample from assumptions made in the BLUE method (e.g., Gaussian 
uncertainties). Applying the combination procedure to the NWA, KIN, and PHI data sample 
measurements, and correcting the returned statistical uncertainty by the average pull width 
over all mass samples (a scale factor of 1.15), yields a top quark mass of 170.1 ±6.0 (stat.) ± 
4.1 (syst.) GeV/c 2 . The close proximity of the three template measurements with respect 
to their measured uncertainties leads to a x 2 P er degrees of freedom for the combination of 
X 2 /d.o.f. = 0.017/2, corresponding to a p-value of 99%. 
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FIG. 23: Summary of the difference between output and input top quark mass (upper) and width 
of pull distributions (lower) for the combination pseudo-experiments, as a function of generated 
input top quark mass. 



TABLE V: Summary of measured top quark masses for the Neutrino Weighting Algorithm (NWA), 
Full Kinematic Analysis (KIN), and Neutrino cf> Weighting Method (PHI), along with their statis- 
tical (total) correlations, and contributing weight to the combined top quark mass result. 



method 


result (GeV/c 2 ) 


NWA 


correlation 
KIN 


PHI 


weight 


NWA 
KIN 
PHI 


170.7±|;§ (stat.) ± 4.6 (syst.) 
169.5l££ (stat.) ± 4.0 (syst.) 
169.7±|;g (stat.) ± 4.0 (syst.) 


1.00 (1.00) 


0.14 (0.32) 
1.00 (1.00) 


0.25 (0.40) 
0.35 (0.46) 
1.00 (1.00) 


47% 
38% 
15% 
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IX. SUMMARY 



We have performed three separate measurements of the top quark mass from ti events 
produced in pp collisions at a center-of-mass energy of 1.96 TeV, using the Run II Collider 
Detector at Fermilab. The mass measurements employ one of two complementary selection 
algorithms to extract it events where both W bosons from the top quarks decay into leptons 
(ez/, fiv, or rz/), producing data samples of 340 and 360 pb _1 . Each measurement technique 
determines a single top quark mass for an event by making different assumptions in order to 
resolve the underconstrained dilepton ti decays. For each method, template mass distribu- 
tions are constructed from simulated signal and background processes, and parameterized to 
form continuous probability density functions. A likelihood fit incorporating these parame- 
terized templates is then performed on the data sample masses in order to derive a final top 
quark mass. 

One method, the Neutrino Weighting Algorithm (NWA), measures a top quark mass of 
170.7^5 (stat.)±4.6 (syst.) GeV/c 2 . A second technique, called the Full Kinematic Analysis 
(KIN), results in a mass measurement of 169. 5^72 (stat.) ± 4.0 (syst.) GeV/c 2 . A third 
analysis using the Neutrino Weighting Method (PHI) measures a value of 169.7ljj;[j (stat.)± 
4.0 (syst.) GeV/c 2 . Accounting for correlations in the statistical and systematic uncertainties 
between methods, we combine the three results, giving a top quark mass in the dilepton 
channel of 170.1 ± 6.0 (stat.) ±4.1 (syst.) GeV/c 2 . This combined result is consistent 
with the CDF Run II "lepton+jets" channel J29J], which used a 318 pb _1 data sample to 
measure a top quark mass of 173.5lgg GeV/c 2 , and thus gives no indication of new physics 
in the dilepton channel. The three template analyses are also consistent with a fourth CDF 
top quark mass measurement in the dilepton channel j^], which applies a matrix-element 
technique to the 33-event DIL selection sample. 
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